A Forward-backward Algorithm for Stochastic Control Problems - Using the Stochastic Maximum Principle as an Alternative to Dynamic Programming
نویسندگان
چکیده
An algorithm for solving continuous-time stochastic optimal control problems is presented. The numerical scheme is based on the stochastic maximum principle (SMP) as an alternative to the widely studied dynamic programming principle (DDP). By using the SMP, (Peng, 1990) obtained a system of coupled forwardbackward stochastic differential equations (FBSDE) with an external optimality condition. We extend the numerical scheme of (Delarue and Menozzi, 2006) by a Newton-Raphson method to solve the FBSDE system and the optimality condition simultaneously. As far as the authors are aware, this is the first fully explicit numerical scheme for the solution of optimal control problems through the solution of the corresponding extended FBSDE system. We discuss possible numerical advantages to the DDP approach and consider an optimal investment-consumption problem as an example.
منابع مشابه
Optimal Portfolio Allocation of Commodity Related Assets in Illiquid Markets and a Forward-backward Algorithm to Solve the Stochastic Control Problem
In the first part of the talk, an algorithm for solving continuous‐time stochastic optimal control problems is presented. The numerical scheme is based on the stochastic maximum principle (SMP) as an alternative to the widely studied dynamic programming principle (DPP). We show possible performance advantages of the algorithm in the case of feedback control. In the second...
متن کاملRobust inter and intra-cell layouts design model dealing with stochastic dynamic problems
In this paper, a novel quadratic assignment-based mathematical model is developed for concurrent design of robust inter and intra-cell layouts in dynamic stochastic environments of manufacturing systems. In the proposed model, in addition to considering time value of money, the product demands are presumed to be dependent normally distributed random variables with known expectation, variance, a...
متن کاملExpected Duration of Dynamic Markov PERT Networks
Abstract : In this paper , we apply the stochastic dynamic programming to approximate the mean project completion time in dynamic Markov PERT networks. It is assumed that the activity durations are independent random variables with exponential distributions, but some social and economical problems influence the mean of activity durations. It is also assumed that the social problems evolve in ac...
متن کاملStochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry
We present a stochastic dynamic programming approach with Markov chains for optimal control of the forest sector. The forest is managed via continuous cover forestry and the complete system is sustainable. Forest industry production, logistic solutions and harvest levels are optimized based on the sequentially revealed states of the markets. Adaptive full system optimization is necessary for co...
متن کاملAn Application of the Stochastic Optimal Control Algorithm (OPTCON) to the Public Sector Economy of Iran
In this paper we first describe the stochastic optimal control algorithm called ((OPTCON)). The algorithm minimizes an intertemporal objective loss function subject to a nonlinear dynamic system in order to achieve optimal value of control (or instrument) variables. Second as an application, we implemented the algorithm by the statistical programming system ((GAUSS)) to determine the optimal fi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012